The Automated Detection of Racist Discourse in Dutch Social Media
نویسندگان
چکیده
We present two experiments on the automated detection of racist discourse in Dutch social media. In both experiments, multiple classifiers are trained on the same training set. This training set consists of Dutch posts retrieved from two public Belgian social media pages which are likely to attract racist reactions. The posts were labeled as racist or non-racist by multiple annotators, who reached an acceptable agreement score. The different classification models all use the Support Vector Machine algorithm, but use different (sets of) linguistic features, which can be lexical, stylistic or dictionary-based. In the first experiment, the models are evaluated on a test set containing unseen comments retrieved from the same pages as the training set (and thus also skewed towards racism). In the second experiment, the same models from Experiment 1 are tested on an alternative test set, containing more neutral comments, retrieved from the social media page of a Belgian newspaper. In both experiments, the best performing model relies on a dictionary containing different word categories specifically related to racist discourse. It reaches an F-score of 0.47 (exp. 1) and 0.40 (exp. 2) for the racist class and ROC Area Under Curve scores of 0.64 (exp. 1) and 0.73 (exp. 2). The dictionaries, code, and the procedure for requesting the corpus are available at: https://github.com/clips/hades.
منابع مشابه
A Dictionary-based Approach to Racism Detection in Dutch Social Media
We present a dictionary-based approach to racism detection in Dutch social media comments, which were retrieved from two public Belgian social media sites likely to attract racist reactions. These comments were labeled as racist or non-racist by multiple annotators. For our approach, three discourse dictionaries were created: first, we created a dictionary by retrieving possibly racist and more...
متن کاملStructuring Racist Ideologies in Stephen Crane’s “A Dark Brown Dog”: A Critical Discourse Analysis
This paper deals with the study of how racist ideologies are constructed in Crane’s “A Dark Brown Dog” using the CDA framework. Benefitting from the approaching between literature and linguistics, it focuses on the linguistic examination of the (re)construction of whiteness and blackness based on the assumption that racism is: a social, a discursive, and an ideological construct. This tri-dimen...
متن کاملPlatformed antagonism: racist discourses on fake Muslim Facebook pages
This research examines how fake identities on social media create and sustain antagonistic and racist discourses. It does so by analysing 11 Danish Facebook pages, disguised as Muslim extremists living in Denmark, conspiring to kill and rape Danish citizens. It explores how anonymous content producers utilise Facebook’s socio-technical characteristics to construct, what we propose to term as, p...
متن کاملA Separation, an Ideological Rift in the Iranian Society and Culture: Media, Discourse and Ideology
Media can be a good representation of dominant ideologies in society. The analysis of such discourse can shed light on the mental and social structures of people in society. Adopting van Dijk’s (1995) layout of discourse ideology and his (2000) practical and general outline of ideological analysis, this study analyzes the Iranian movie A Separation, the winner of the 84th An...
متن کاملChallenges & Compromises in Spike Lee ’ s
h—This study looks at Spike Lee’s Malcolm X as an important text in understanding Afrocentric perspectives that challenge the ideological stereotypes of mainstream Hollywood film. Malcolm X intervenes between Lee, the filmmaker, and the powerful media industry and is emblematic of the larger discussion of hegemonic and counter-hegemonic views in media culture. This film is not only an interesti...
متن کامل